Joint Anchor-Feature Refinement for Real-Time Accurate Object Detection in Images and Videos
نویسندگان
چکیده
Object detection has been vigorously investigated for years but fast accurate real-world scenes remains a very challenging problem. Overcoming drawbacks of single-stage detectors, we take aim at precisely detecting objects static and temporal in real time. Firstly, as dual refinement mechanism, novel anchor-offset is designed, which includes an anchor refinement, feature location deformable head. This new mode able to simultaneously perform two-step regression capture object features. Based on the detection, network (DRNet) developed high-performance where multi-deformable head further designed leverage contextual information describing objects. As videos, networks (TRNet) (TDRNet) are by propagating across We also propose soft strategy temporally match motion with previous refinement. Our proposed methods evaluated PASCAL VOC, COCO, ImageNet VID datasets. Extensive comparisons verify superiority DRNet, TRNet, TDRNet. Consequently, our approaches run fairly speed, meantime achieve significantly enhanced accuracy, i.e., 84.4% mAP VOC 2007, 83.6% 2012, 69.4% 2017, 42.4% AP COCO. Ultimately, producing encouraging results, applied online underwater grasping autonomous system. Codes publicly available https://github.com/SeanChenxy/TDRN.
منابع مشابه
Interactive Feature Growing for Accurate Object Detection in Megapixel Images
Automatic object detection in megapixel images is quite inaccurate and a time and memory expensive task, even with feature detectors and descriptors like SIFT, SURF, ORB, and KAZE. In this paper we propose an interactive feature growing process, which draws on the efficiency of the users’ visual system. The performance of the visual system in search tasks is not affected by the pixel density, s...
متن کاملAlgorithms for Real-Time Object Detection in Images
The field of Computer Vision (CV) is still in its infancy. It has many real-world applications, and many breakthroughs are yet to be made. Most of the companies in existence today that have products based on CV can be divided into three main categories: auto manufacturing, computer circuit manufacturing, and face recognition. There are other smaller categories of this field that are beginning t...
متن کاملReal-Time Feature Matching for the Accurate Recovery of Augmented-Reality Display in Laparoscopic Videos
Augmented-Reality (AR) displays increase surgeon’s visual awareness of high-risk surgical targets (e.g., the location of a tumor) by accurately overlaying pre-operative radiological 3-D model onto the intra-operative laparoscopic video. Existing AR systems are not robust to sudden camera motion or prolonged occlusions, which can cause the loss of those anchor points tracked along the video sequ...
متن کاملObject Detection in Real Images
Object detection and recognition are important problems in computer vision. Since these problems are meta-heuristic, despite a lot of research, practically usable, intelligent, real-time, and dynamic object detection/recognition methods are still unavailable. We propose a new object detection/recognition method, which improves over the existing methods in every stage of the object detection/rec...
متن کاملReal-Time Automatic Linear Feature Detection in Images
Linear feature detection in digital images is an important low-level operation in computer vision that has many applications. In remote sensing tasks, it can be used to extract roads, railroads, and rivers from satellite or low-resolution aerial images, which can be used for the capture or update of data for geographic information and navigation systems. In addition, it is useful in medical ima...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Circuits and Systems for Video Technology
سال: 2021
ISSN: ['1051-8215', '1558-2205']
DOI: https://doi.org/10.1109/tcsvt.2020.2980876